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Abstract 



o 
O 

The Random Demodulator (RD) and the Modulated Wideband Converter (MWC) are two re- 
cently proposed compressed sensing (CS) techniques for the acquisition of continuous-time spectrally- 
sparse signals. They extend the standard CS paradigm from sampling discrete, finite dimensional 
signals to sampling continuous and possibly infinite dimensional ones, and thus establish the ability 
to capture these signals at sub-Nyquist sampling rates. The RD and the MWC have remarkably 
similar structures (similar block diagrams), but their reconstruction algorithms and signal models 
, strongly differ. To date, few results exist that compare these systems, and owing to the potential 

impacts they could have on spectral estimation in applications like electromagnetic scanning and 
cognitive radio, we more fully investigate their relationship in this paper. We show that the RD 
and the MWC are both based on the general concept of random filtering, but employ significantly 
different sampling functions. We also investigate system sensitivities (or robustness) to sparse signal 
model assumptions. Lastly, we show that "block convolution" is a fundamental aspect of the MWC, 
allowing it to successfully sample and reconstruct block-sparse (multiband) signals. Based on this 
concept, we propose a new acquisition system for continuous-time signals whose amplitudes are block 
sparse. The paper includes detailed time and frequency domain analyses of the RD and the MWC 
\ , that differ, sometimes substantially, from published results. 

a, 

1 Introduction 

The theory of compressed sensing (CS) says that if a signal is sufficiently sparse with respect to some basis 
or frame, it can be faithfully reconstructed from a small set of linear, nonadaptive measurements [1^3]. 
When the signal belongs to a finite dimensional space, this statement means that it can be reconstructed 
from a set of measurements whose cardinality may be significantly less than the space's dimension. It also 
implies that the measurement process is described by an underdetermincd linear system of equations, or 
equivalently, by a rectangular matrix with more columns than rows. The fundamental work of Candcs, 
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Romberg, and Tao [4] and Donoho p] established sufficient conditions upon such sensing matrices, that 
if satisfied, allow the stable inversion of the linear systems. A key aspect of CS, and one which plays an 
important role in this paper, is that sensing matrices drawn at random^] often satisfy these conditions. 

Conceptually, CS theory has three main thrusts: (1) the development of recovery methods that 
efficiently and faithfully reconstruct the original signal from its compressed samples, (2) the investigation 
of new signal models that effectively represent signal sparsity or other signal structure, and (3) the 
creation of new sampling (measurement) mechanisms that acquire signals in a compressed manner. The 
first concerns the reconstruction process and asks how one specifically reconstructs the original signal 
from the CS measurements (see, e.g., [1)4,5 ). The second concerns the examination of different signal 
classes of interest and asks if are structured representations that can be exploited [5HS]. The third 
concerns the design of the physical sampling system and asks how one devises a system to acquire CS 
measurements [7ll9Ullj. This paper focuses on the third thrust and examines two sampling systems for 
two distinct, but related, signal models. 

Several CS based signal acquisition systems have been proposed for both continuous (analogue) and 
discrete signals. For example, the single-pixel camera |12j is a novel compressive imaging system, where 
light is projected onto a random basis using a micro-mirror device, and then the projected image is 
captured by a single photo-diode (the single "pixel" ) . Other examples include random filtering [13] and 
random convolution [5] that advocate random linear filtering and low rate sampling as a means to collect 
CS measurements. In these cases, "random" filters are linear filters whose impulse responses are realisa- 
tions of particular random processes. Along the same lines, the Random Demodulator (RD) 10, 14"lll5] 
and the Modulated Wideband Converter (MWC) [TT1ITMT5] have recently been proposed as CS sampling 
systems that target continuous-time spectrally-sparse signals. The RD is a single channel, uniform 
sub-Nyquist sampling strategy for acquiring sparse multitone signals; the MWC is a multi-channel, 
uniform sub-Nyquist sampling strategy for acquiring sparse multiband signals. (Precise definitions for 
these two signal classes are provided in Sectional) The RD and the MWC have tremendous potential 
impact because of the longstanding, proven usefulness of spectral signal models in many engineering and 
scientific applications (e.g. communications, radar/sonar, medical imaging, etc.). Perhaps owing to the 
near coincidental emergence of these systems, few results exist to date that reconcile their remarkably 
similar structures (see Figure [I} with their different reconstruction algorithms. In fact, the current 
literature paints a somewhat artificial dividing line between the RD and the MWC, preferring to focus 
primarily on one scheme or another rather than drawing connections between them. One exception is 
the recent comparative analysis by Mishali et al. [20 that focuses on the systems' robustness to signal 
model perturbations and computational and hardware complexity. We comment more on 20J below. 

In this paper, we offer new insights into the relationship between the RD and the MWC that com- 
plement the original works of Tropp et al. [10] and Mishali and Eldar [11] . We apply tools from modern 
sampling theory and classical Fourier analysis and show that the RD and the MWC are two manifes- 

1 There are several ways to construct viable random sensing matrices. For example, its entries could simply be indepen- 
dent and identically distributed realisations of a a zero mean, unit variance Gaussian random variable. 

2 An early multi-channel random demodulator was proposed in 1191 . 
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tations of the same CS sampling approach, namely random filtering/convolution 9, 13 . This fact is 
reflected the systems' similar structure. At the same time, we show that the sampling functions char- 
acterising the systems strongly distinguish the two schemes. In Section [3] we examine three different 
properties of the RD and the MWC related to the underlying assumption on signal sparsity. We dis- 
cuss how sparsity manifests itself in each case and comment on the system's sensitivity or robustness to 
changes in sparsity. In Section^ we highlight, among other insights, the MWC's use of block convolution 
as a principal processing step that enables it to successfully sample and recover "block-sparse" signals, 
i.e. signals whose nonzero components are grouped together. Extending this idea, we propose a new CS 
based sampling system and show through an example that it can successfully sample and reconstruct 
continuous-time signals that are block sparse in the time domain. 

To be clear, we do not discuss the conditions of successful reconstruction, nor implementation issues in 
this paper. The original works of Tropp et al. jlOj and Mishali and Eldar [11) . and even some subsequent 
scholarship 16-18 , extensively investigate these issues. Some of the reconstruction conditions will be 
stated in the descriptions of the systems in Section [3] but the presumption throughout the paper is 
that the RD and the MWC are theoretically proven CS based techniques to sample and reconstruct 
continuous-time spectrally-sparse signals. In addition, we only examine the idealised RD and MWC 
because their fundamental similarities and differences are sharper in this setting (e.g., one does not have 
to account for the effects of non- ideal filters) . Aspects of practical implementations are discussed in [II] 
and [H]. 

This paper and the comparative analysis presented in [20] are similar in some respects; however, the 
approach and the conclusions are very different. For example, both touch on a model sensitivity issue of 
the RD, but in [20J this sensitivity is billed as a fundamental shortcoming in comparison to the MWC 
because it does not exhibit the exact same sensitivity. In contrast, we argue here that the sensitivity is a 
manifestation of a CS sensitivity and that the MWC also inherits a shortcoming from CS theory, albeit a 
different one than the RD. In short, the approach of the present paper asks if there is an underlying link 
in the way the RD and the MWC process their signals and then examines their similarities and differences 
from this common perspective. This perspective yields a consistent and broader understanding of these 
systems and their relation to other CS schemes and standard sampling theory. 

Throughout the paper, we denote time domain signals by lower case letters (e.g. x, y, tp) and frequency 
domain signals by upper case letters (e.g. X,Y). Vectors and matrices are indicated by boldface type 
(e.g. x, Y, <&). Parameters are denoted by upper case letters with one exception: the number of channels 
in the MWC is denoted by q' . 

Contributions. The main contributions of this paper are: 

• a consistent analysis that (i) clearly shows random filtering underlies the RD and the MWC and 
(ii) highlights system sensitivities 

• the insight that block convolution fundamentally enables the MWC to sample and recover frequency 
block-sparse signals, and the generalization of this idea to a new sampling system. 
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Figure 1: Time domain block diagrams of the random demodulator (RD) and the modulated wideband converter 
(MWC). The RD is characterised by T, the duration of the observation interval and M, a sampling rate parameter. 
The MWC is characterised by L' , a parameter for the period of Pi(i), M' , a sampling rate parameter, and q' , the 
number of channels. The primary structural difference between the systems is the type of filter employed prior 
to sampling — the RD uses an ideal integrator and the MWC uses ideal low pass filters. 



2 Sampling mechanisms and signal models 

In this section, we examine the sampling mechanisms of the RD and the MWC from a modern sampling 
theory perspective. We show the output samples for both systems are equal to the inner products of the 
input signal with a set of sampling functions that arise from the systems' designs. We observe that unlike 
typical sampling functions, these sampling functions involve random waveforms, a central component in 
many CS sampling systems. If the inner products are interpreted as analogue filtering operations, we 
show that the samples result from a generalised random filtering or random convolution as described by 
Romberg [S] and Tropp et al. [13] as a means to acquire CS samples. This analysis suggests that the RD 
and the MWC are two manifestations of the same sampling approach, but differ in the specific form of 
the sampling functions. The difference in sampling functions also reflects the difference in the assumed 
signal models for the RD and the MWC. We do not introduce the notion of signal sparsity in this section 
because the conclusions reached do not depend on this aspect. Signal sparsity and its consequences are 
discussed in Section [3j 



2.1 Sampling with the random demodulator 

Let x(t) be a continuous-time, complex-valued signal defined on the real line. The RD acquires samples 
of x(t) on a finite observation interval where here, we assume, without loss of generality, that the samples 
are collected in the interval [0,T] seconds. In [TO], Tropp et al. adopt a particular signal model for x(t) 
on this interval. They assume in part that x(t) has a Fourier series (FS) expansion on [0,T] which has 
bounded harmonics, i.e., — W/2 < ^ < W/2 Hz for neZ. On this interval, x{t) is therefore modeled as 

AT/2-1 

x(t)= J2 X(n)e^\ te[0,T], (1) 

n=-N/2 



4 



Reconciling CS Sampling 

where {X(n)} denotes the FS coefficients of x(t) and N = TW. For ease of exposition, TV is assumed 
to be an even positive integer. This signal model is often called a multitone model. 

To acquire the samples, a RD first multiplies x(t) by a waveform p{t) and then filters and samples the 



product x(t)p(t) on [0,T] (see Figure 1(a) I. The signal p{t) is taken to be a realisation of a continuous 
random process derived from a vector of Bernoulli random variables. Let Z = [Zq, . . . , Z^-i] be a vector 
of independent and identically distributed Bernoulli random variables Zi taking values ±1 with equal 
probabilitj|f| and let p(t; Z) denote the random process 

I i + r 



p(t;Z) = Z u te 



W W 



l = 0,...,N-l. (2) 



A realisation Zq of Z produces a single realisation p(t; Zo) of p(t; Z). Here, we abbreviate p(t; Zq) by p(t) 
and thus consider p(t) to be a deterministic quantity, although its randomness plays an important role in 
proving performance guarantees [10] , In this paper, we sometimes refer to p(t) as a random waveform in 
deference to this point. We stress that when acquiring samples on [0,T], the RD uses a single realisation 
of p(t; Z), but different realisations may be used for other observation intervals. Note also that p(t) has 
the FS representation, 

oo 

p(t)= £ P(n)e^ nt , te[0,T] (3) 

n— — oo 

where {P(n)} is the set of FS coefficients of p(t). 

The analogue filter in the RD design is taken to be an ideal integrator with impulse response h(t) = 
rect(^i - l), where 

!1 for - 1 < x < 1 
, (4) 
otherwise 

and M G Z + . The sampling period T s is taken to be M times shorter than the observation window 
(T s = T/M). The system therefore samples at the rate of M/T Hz. The multitone signal model and 
the RD sampling system are therefore parameterised by N, the parameter equal to the time-frequency 
product TW and M, the parameter that controls the RD's sampling rate. Here, we assume that M < N. 

The goal of the RD is to sample x(t) at low rates while retaining the ability to reconstruct it in 
the interval [0,T]. Reconstruction entails the discovery of the active frequencies (the signal's spectral 
support) and the amplitude of the corresponding FS coefficients. If x(t) is spectrally sparse on [0, T], 
then reconstruction is possible using CS algorithms |10] . In this case, we note that signal reconstruction 
only implies the recovery of the spectral content of x(t) in the observation interval. In other words, the 
samples y{k) 1 k = 0, . . . ,M — 1, do not convey information about the spectral content of x(t) outside 
of this interval. To obtain spectral information outside of [0, T], the RD must be applied to other 
intervals (of possibly different durations). If the RD is applied to consecutive intervals, a time-frequency 
decomposition of x(t) similar to the short-time Fourier transform can be obtained for multitone signals. 



Time domain description. By inspection of Figure 1(a) the output samples y(k) can be expressed 



3 A ±1 symmetric Bernoulli random variable is also known as a Rademacher random variable. 
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as 



y(fc)=s((k + l)&) 

(5) 



/ x{T)p(T)vect(^(t - r) - 1) dr 
Jo 



*=( fe + 1 )w 



for = 0, . . . , M — 1 where g(t) = x(t)p(t) * h(t) (* denotes convolution). By substituting (JlJ into this 
expression and evaluating the integral, the following equation relating the time domain samples y(k) to 
the FS coefficients X(n) results: 

N/2—1 (j k + 1 )jj- 1 
y(k) = ]T a(n)X(n) £ Pl e^ n \ (6) 

n=-N/2 N_ 
l - K M 

for k = 0, . . . M — 1 where p/ = p(l/W) and 

a(n) = { j2nn 

1/W n = 

Tropp et al. derived ([6|) in [10] by analysing an equivalent digital system. In Appendix 16.11 we provide 
an alternate derivation that explicitly shows the analogue processing inherent in sampling with the RD. 

Because sampling is a linear operation with the RD, the samples y(k) can be viewed as inner products 
of x(t) with the set of sampling functions {p(r)rect(2fc + 1 — ^-t)} where 

y(k) = (x(T),p(T)rect(2fe + 1 - ^r)>, (7) 

for k = 0, . . . , M - 1 and 

(x(t),s(t)) = I x(t)s*(t) dt, 
Jo 

for two continuous functions x(t), s(t) on [0,T]. These sampling functions have finite duration in time 
(T/M seconds), but because their Fourier transforms involve sine functions, they extend infinitely in 
frequency. In the time-frequency plane, their support partitions the space into vertical strips of equal 
width (see Figurc[2j left panel). We note that unlike modern sampling theory [22], the sampling functions 
in ([7]) contain the random waveform p(t), and the conditions they must satisfy to ensure stable recovery 
is governed by CS theory and not Shannon-Nyquist based sampling theory. (Refer to [35] and [7] for 
details regarding the conditions that sampling functions typically must satisfy.) 

From it is clear the samples y(k) can be thought of as pointwise evaluations of the convolution 
between x(t)p(t) and an ideal integrator. Equally valid, however, is the view that the samples are 
pointwise evaluations of a random, linear filtering operation involving x(t) and the time- varying analogue 
filter h(t,r) =p(r)rect(2M(t-T) - l), 



y(k) = [ x(T)h(t,T) dr 



(8) 
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Figure 2: The output samples of both the RD and the MWC can be described as inner products of the input 
signal x(t) with certain sets of sampling functions. The panel on the left depicts the time-frequency support of 
the RD sampling functions where each vertical strip represents the support of one sampling function. Similarly, 
the panel on the right depicts the support of the MWC sampling functions where each horizontal strip represents 
the support of one sampling function. For the RD and the MWC, the support characteristics of the sampling 
functions directly derive from the type of analogue filters used prior to sampling. The RD and the MWC 
represent two extreme cases: The RD has perfectly localised support in time but completely unlocalized support 
in frequency. The MWC is the exact opposite. 



for k = 0, ...,M — 1. Here, the impulse response h(t,r) is considered random because at each time 
instance it is a windowed portion of a signal that randomly alternates between ±1. The samples y(k) 
can therefore be thought of as the result of a random filtering operation, conceptually similar to the 
random filtering schemes proposed in [9] and [I5j . In [13] . Tropp et al. proposed a CS sampling scheme 
where a sparse discrete-time signal is first filtered by a digital filter whose impulse response is a realisation 
of a sequence of independent and identically distributed random variables, and then subsampled at a low 
rate. They illustrated through examples that with the use of CS recovery algorithms random filtering 
is a potential sampling structure to acquire CS measurements for sparse discrete time signals. In [9], 
Romberg proposed and examined a similar idea but considered a specific digital filter that randomly 
changes the phase of the input signal. Interestingly, Romberg considered the RD as a separate, follow- 
on processing step to his approach instead of considering it as a generalisation to his notion of random 
convolution. Here, (j8]) shows that the sampling mechanism of the RD can be viewed as a random filtering 
operation applied to continuous-time signals. We note, however, that the filtering operation in ([5]) is not 
a convolution because of the time-varying nature of h(t,r). Strictly speaking then ((SJ is distinct from 
the systems proposed in |13j and [5], although random filtering remains a common thread. 

Frequency domain description. An equivalent frequency domain expression to (JSJ) can be derived 
(see Appendix 16.11) that relates the discrete Fourier transform (DFT) of y{k), denoted by Y(n), to the 
FS coefficients X(n), 

, \ 

n+- 

Y{n)=T ^2 P{m)e~ j ^ n sinc{^n)X(n-m), (9) 

m=n— "2" + l 

n = 0, . . . , M — 1, where sinc(x) = sin (x) /x, x G R. This equation clearly shows the frequency domain 
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convolution caused by the multiplication with p(t) and the effect of filtering with an ideal integrator 
(indicated by the presence of the e _J ^ n sinc( -^n) term). Thus, one can also interpret Y(n) as the 
output of a random, frequency-varying filter with impulse response H(n,m) = P(m)e~ :> T"sinc(-p>n). 
We see therefore that the RD's output in either the time or frequency domains can be viewed as the 
output of a random filter or convolution. 

2.2 Sampling with the modulated wideband converter 

We now let x(t) be a bandlimited, finite energy signal. The spectral content of x(t) on R is then 
appropriately given by its Fourier transform (FT) X(ui), 

p oo 

X{uj)= / x{t)e? ui dt. 



Here, x(t) is bandlimited in the usual sense, i.e., X(w) is assumed to be bounded: X{uj) — for 
| a; | > irW radians per second, W £ M + , where ttW is the bandwidth of x(t) and 2wW is the Nyquist 
frequency in radians per second. We adopt the following definition from [23] . The class of multiband 
signals B(J-, W) is the set of bandlimited, finite energy signals whose spectral support is a finite union 
of bounded intervals, 

B(T, W') = {x(t) £ L 2 (R) : X{u) = 0,lu£'F} (10) 

where 

K 

J 7 ={J[a i ,bi), \ ai \,\bi\ <ttW'. (11) 

i=l 

In the following description of the MWC, primes are added to the parameters to distinguish them 
from the parameters of the RD. The same letters are, however, used for similar quantities. For example, 
W/2 denotes the bound on the harmonics of multitone signals while W'/2 denotes the bandwidth of the 
multiband signals. 

Like the RD, the zth channel of the MWC multiplies x(t) by a random signal Pi(t), then filters and 



samples the product x(t)pi(t) at a sub- Nyquist rate (see Figure 1(b) I. As in the original formulation, we 
assume each channel's filter is an ideal low pass filter, although it has been shown that the MWC can 
operate with non-ideal low pass filters |17) . Here, we examine its original formulation to make a clearer 
comparison to the RD. The signals Piit), i = 0, . . . , q' — 1, are periodic extensions of different realisations 
of the continuous random process used for the RD. Formally, let Z = {Z{\, be a sequence of independent 
and identically distributed Bernoulli random variables taking values ±1 with equal probability, and for 
some positive integer L' , let p(t; Z) denote the random process 

p(t;Z) = Zi, t£ | = o,. . (12) 



-W w 

Let Zi denotes a particular realisation of Z. The signals Pi(t) are then periodic extensions of the 
realisations p(t; Zi) of p(t; Z): 

Pl {t + mT p ) =p(t;Zi), forte[0,T p ], raeZ, (13) 
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where T p — -^r?. The impulse response of the ideal low pass analogue filter is h(t) — E ^-smc( :!L ^-t), 
implying a cut-off frequency of 2 ^^f radians per second. (This is a slightly different assumption than 
that made in [TT] where the cut-off frequency was set to W'/M'.) Each channel samples at a rate that 
is M' £ K + times slower than the Nyquist rate, i.e. ^- = jp- Hz, where T s is the channels' sampling 
period. The system's average sampling rate is q'W /M' Hz. In [TT], Mishali and Eldar showed that a 
necessary condition for successful reconstruction is M' < L' or that T s <T p . We assume this condition 
holds for the MWC throughout the paper. We also assume that the average rate is always less than the 
Nyquist rate (q' < M'). 



Time domain description. By inspection of Figure 1(b) we obtain the following time-domain 
expression for a single channel of the MWC: 



W(fc)= fli (fcfr) 
irW r °° 



-/ x(r) Pi (r) S mc(^ (t-r))dT 

J — OO 



w 



(14) 



for all k € Z. This expression corresponds to the time domain expression in ([5]) for the RD. Like 
the RD, the samples yi{k) can be interpreted as inner products with a set of sampling functions 
{^p l (r)sinc(^fc - ^r)} where 

lfc(k) = (x(t), ^(r)sinc(^fc - zg-r)). (15) 

But in contrast to the RD, these sampling functions have finite frequency support and infinite temporal 
support. In the time-frequency plane, their support partitions the space into horizontal strips of width 
W /L' Hz (see Figure [2] right panel). This particular set of sampling functions represents one instance 
of a general theory put forth by Eldar [7] to compressively sample continuous-time signals from unions of 
shift-invariant spaces, of which multiband signals are members. The theory combines modern sampling 
theory with CS theory in such a way that samples are acquired in a typical manner by projecting the 
signal onto a set of sampling functions (as in (|15p ). but CS theory is needed for reconstruction. We do 
not review the details of this theory here because it does not apply to the RD. 

Interpreting (| X4[) as a random filtering, we identify the time-varying impulse response as 



hi(t,r) = :!L ^-pi(T)sinc( IL ^-(t — r)). Because the MWC employs an ideal low pass filter, the impulse 
response contains a sine function instead of a rectangular function as seen in @. Consequently, the 
impulse response has infinite temporal extent in this (ideal) setting. For the MWC, hi(t,r) is random 
in the same general sense as the RD's time- varying impulse response — the sine function is multiplied by 
a realisation of a random process. 

Frequency domain description. Using standard Fourier analysis techniques, Mishali and El- 
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dar [11] derived the following frequency domain description for the i th channel of the MWC, 



, Li(i'+DJ 

= M7 E P i (m)X(o;-m^)rect(^ 7 a;) (16) 
m=-L|(i'+l)J+l 

Li(i'+l)J L'-l 

£ /3( m )X(a,-m^)rect(^) ^T^e"^" 1 ', (17) 
m =_L|(i'+i)J+i i=o 

where Pi(m) denotes the FS coefficients of Pi(t), |_-J denotes the floor rounding operation, pu — Pi(t) for 
t e [l/W, (I + 1)/W) and 

(W' l-e-i& m 
/3(m) = I M> j2Trm r . 

[l/L', m = 

Appendix 16.21 contains a slightly different derivation of (fT7|) than that presented in [11]. Compar- 
ing (fTB")) to ©, we observe that the DTFT of the output sequences yi(k) can again be interpreted 
as the result of a random convolution, where the frequency-varying impulse response is given by 
Hi(m,L)) = Pi(m)rect[-^yjuj) . The spectral content of the samples Di(k) is expressed by the DTFT, 
as opposed to the DFT, because x(t) is defined on the entire real line for the MWC instead of on an 
interval. We also note that the scalars j3(m) are the complex conjugates of a(n) in ([5]). 

Single channel MWC. There are two ways to collapse the MWC into an equivalent single channel 
system. One can either lengthen the observation interval by a factor of q' (keeping all other parameters 
fixed) , or one can consider increasing the sampling rate while maintaining the same observation interval. 
If the observation interval is lengthened, the sequence of samples from a single channel MWC can be 
partitioned into q' groups of W'T/M' , where each group of samples is thought of as the output from an 
individual channel in the multi-channel configuration. Alternatively, one can set the sampling rate of a 
single channel MWC equal to the average rate of a multi-channel MWC, i.e., set the sampling rate to 
q'W /M' Hz, and accordingly adjust the low pass filter's cut-off frequency to q'W'/L' Hz (see Figure[3]). 
Notice that in this case we still maintain the requirement M' < L' . The frequency domain description 
of this single channel MWC can now be obtained from (fl6|) by substituting M'/q' for M' and L'q' for 
L'\ 

L' 



r( e jw ^)rect(^ 7 u;) 

L' \ \rf 2-KW'q' 

—t- /. i 1 X I i.i — m 3— 



n'W L ^ (L ' /9 ' +1)J , , (18) 

q ~^T E P(m)rect(^)X(o;-m^V, 



=-Lf(i<7g'+i)J+i 

Summary. Equations ©, ([HI), and (flB")) all indicate that the sampling mechanisms for the RD 
and the MWC are based on analogue random filtering/convolution. However, the RD's integrator and 
the MWC's low pass filter induce significant differences in the specific form of the random convolutions, 
or equivalently, in their sampling functions. In fact, the different filters induce bipolar time-frequency 
characterisations that make them well-suited for the signal models they target — an ideal integrator with 
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Figure 3: Block diagram of a single channel MWC. To be equivalent to the multi-channel system depicted in 



Figure 1(b) this system samples at a rate q' times faster and has a low pass filter with a cut-off frequency q' 



times greater. Additional digital processing is also required to form the linear system in ((20 



a finite impulse response is well-suited to signals modeled on a finite interval and an ideal low pass filter 
with a finite frequency response is well-suited to signals modeled on a finite frequency band. 



3 Sparse signal models and system characteristics 

This section introduces the notion of signal sparsity and briefly discusses signal reconstruction for the 
RD and the MWC. We show that in practice the MWC can at best recover an approximation of the input 
multiband signal instead of perfectly reconstructing it. This important fact, although not surprising, 
is completely missing from the current literature. We also discuss three system characteristics related 
to sparsity that ultimately stem from their reliance on CS theory and algorithms. The first is a model 
sensitivity of the RD that is already a familiar limitation 10,24,25 . The second concerns a possible 
sensitivity of the MWC with respect to the number of channels, and while again this behaviour may 
not be surprising, it is nevertheless important in the design of the MWC. The third characteristic, in 
contrast to the second, concerns the robustness of the MWC to increases in the width of the spectral 
bands caused by windowing. The MWC results are new. 

3.1 Sparse multitone signals and RD signal reconstruction 

In the original formulation of the RD, the input signal was not only modeled as a multitone signal on 
the observation interval, but was also assumed to be spectrally sparse |10j . A spectrally sparse multitone 
signal is a multitone signal that has a small number of nonzero FS coefficients out of the N + 1 possible 
(refer to ([1])). More precisely, letting K denote the number of nonzero coefficients (or equivalently the 
number of nonzero frequencies), a spectrally sparse multitone signal is one that satisfies K <C N . 

The reconstruction of a sparse multitone signal x(t) from the samples y(k) hinges on the matrix form 
of @, 

y = £*X (19) 

where S is a M x N matrix of the form 




Pn-i 



Po 





P&-1 







P(M-l)- 
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and 



y=[y(0),...,y(M-l)]' 




f)X(-f ),..., 



«(f -l)X(f -1)]' 



a(n r ) = T 



e 3 N 



1 



a(0) = 1/W 



j2irn. 



■r 



for n r = —N/2 + r, r = 0, . . . , N — 1, and I = 0, . . . , N — 1, where the apostrophes denote transpose. 

By construction, (fT§|) is an underdetermined linear system of equations (SSI* is M x iV with M < A^; 
see Section l2~Tj) and underdetermined systems do not, in general, have unique solutions. Nevertheless, 
CS theory has shown that with the presumed sparsity of X, ([K))) can be solved by a direct application 
of a number of recently developed recovery algorithms, e.g., t\ minimisation 0], orthogonal matching 
pursuit [S], or iterative hard thresholding [2l) ( l27 j . In the CS literature, solving (fTU|) is termed the single 
measurement vector (SMV) problem. Theoretical guarantees regarding the successful recovery of X are 
provided in [TU] in terms of the degree of sparsity and the number of samples (measurements) collected. 

3.2 Sparse multiband signals and MWC signal reconstruction 

Recall that multiband signals are bandlimited, finite energy signals whose spectral support J 7 is a union 
of bounded intervals (see (|10p and (111 j) V A sparse multiband signal is a multiband signal whose support 
has Lebesgue measure that is small relative to the overall signal bandwidth, i.e., X(J-) <C W [35]. If, 
for instance, all the occupied bands (intervals) have equal bandwidth B Hz and the signal is composed 
of K disjoint frequency bands, then a sparse multiband signal is one satisfying KB <C W. In the CS 
literature, signals having this type of "block" structure have been studied in various settings; the central 
question being whether this additional signal structure reduces the minimum number of samples required 
to reconstruct the original signal (see e.g., [5|l2^l5Uj). 

MWC signal reconstruction centres on the matrix form of (fT6)) . 



Y(w) = **S(w) 



(20) 



where 



Y(«) = [y (w),...,iv_i(w)]' 

Y i (w) = Y < (^)rect(^ W ) 



= Pii 




S(w) = [S (u;),...,S L ,- 1 (u J )}' 
S r (u>) = (3(m r ) rect(^w) X(uj - m r ^f^) 



W'l- e- 3 Tr m - 
/3(m r ) — — y2irm r ' *°> = ^ 
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for % = 0,...,q' - 1, I = 0,...,L' - 1, r = 0,...,Z' - 1 and m r = -\_\{V + 1)J + 1 + r. Like (fT9]l . 
this linear system of equations is underdetermined since we assume q' < M' < L' (see Section 12. 2p . 
The vector S(ui) is sparse in the sense that most of its elements (segments of X(u)) do not contain the 
occupied frequency bands that comprise x(t). Equation (|20p can also be derived from the single channel 
MWC, although one has to first extract q' lower rate sample sequences from the single higher rate output 
sequence. We refer the reader to [TT] for details regarding the extra processing steps. 

In practice the linear system in (|20p cannot in general be computed because it theoretically requires 
an infinite amount of data. To see the point, consider the inverse DTFT of (|2U|) . It immediately follows 
that the inverse DTFTs of the spectra Y^uj) = y^'" w^)rect( J^-w) are the time domain sequences 
{yi(k), k e Z}i. If one interprets the spectral segments S r (ui) — /3 r rect(^ 77 oj) X(lu — rrir 2 ^ ) on the 
right hand side of (j2"U|) as single periods of periodic spectra, it follows that their inverse DTFT are the 
sequences {-f r (k), k e Z} r where 

ji pTiW'/L' , 

>^ = / S r (u)e^ k " du>, (21) 

and 

oo L , 

S r (u,)= £ lr (k)e-^^, u,e[-^,^]. (22) 

k— — oo 

The DTFT transform pair of (|20p is therefore the linear system, 

Y = **S (23) 

where $ and ^ are as in (|20j) but Y and S are now infinite column matrices: the rows of Y are the 
sequences yi{k) 1 k £ Z, and the rows of S are the sequences 7r(fc), k G Z. The matrix S is described as 
being jointly sparse because most of its rows are zero since the zero- valued elements of S(cj) correspond 
to zero- valued sequences 7 r (fc) (rows of S). In general, a matrix Z is said to be K joint sparse if there 
are at most K rows in Z that contain nonzero elements. The recovery of S from the measurements Y 
in (|23[) is called an infinite measurement vector (IMV) problem [71 lll[l3Tj because the columns of Y are 
viewed as CS measurements (via the measurement matrix $'4') of a collection of vectors that share a 
common sparse support. 

The limitation of only ever collecting a finite number of samples truncates the rows of Y and causes 
the IMV problem in (l2"3l to become a so-called multiple measurement vector (MMV) problem [3"Tll36| . 
where the goal is to recover a finite number of the columns of the jointly sparse matrix S corresponding 
to the finite number of acquired samples. Using existing CS methods, this MMV problem can be solved 
exactly, or with exceedingly high probability, provided the matrix satisfies certain conditions and 
that enough samples are collected relative to the joint sparsity of S. The solution, however, is in general 
a linear approximation to the true spectral slices Yi (w) because the solution only recovers a finite number 
of coefficients "f r (k) in (|22j) (see |37) for information about linear approximations). This fact is in contrast 
to the sparse multitone signal model that is parameterised by a finite number of parameters and thus 
only requires a finite number of samples for perfect signal reconstruction. 



13 



Reconciling CS Sampling 

In pj] and [31] , Mishali and Eldar proposed a two step reconstruction process termed the "continuous- 
to-finite block" that provably recovers x(t) exactly given an infinite amount of data, or in other words, 
recovers x(t) to an arbitrary precision given sufficient data. The first step recovers the joint support of 
S by solving an associated MMV problem, and the second step uses the recovered support to reduce 
the dimension of the measurement matrix such that a unique least squares solution can be found. 
We stress that even if this two step process perfectly solves the MMV problem derived from ([23"]) . the 
solution can only, in general, approximate x(t) for a finite number of samples. 

3.3 RD sensitivity to basis mismatch 

The ability of CS recovery algorithms to recover the FS coefficients in ([T9"]) depends fundamentally on 
the sparsity of X, or equivalently, on whether x(t) has a sparse FS representation in the observation 
window [0, T] and on the number of acquired measurements. For the RD, the sparsity level of X (number 
of nonzero entries) in (|19p not only changes with the number of tones comprising x(t), but can also be 
increased if there is a mismatch between the Fourier basis in which x(t) is actually sparse and the basis 
in which x(t) is modeled. In fact it is known that x(t) may have a sparse Fourier expansion using 
{exp (j^-nt)} n but may not have a sparse expansion using {exp (j YTgni)} n > where 8 G K is some small 
perturbation |37l p. 379-380]. The implication for the RD is that in a blind sensing scenario, where 
the frequencies of the tones are not known, model mismatches are likely and if the sparsity level of X 
rises above a required level, CS recovery may be jeopardized. This possibility has been described as a 
sensitivity of the RD because a small basis mismatch (small S) can lead to significant reconstruction 
errors (23]. This sensitivity was acknowledged by Tropp et al. in [10] . highlighted in [If}] HO] and 
studied in [3_D[5S]. Recent results, however, could potentially alleviate the problem. In [31], Candes 
et al. showed that sparse multitone signals, defined in terms of an oversampled DFT dictionary, can 
be effectively recovered from undersampled data. Because this model can sparsely represent multitone 
signals on [0, T] as well as on [0, T + 5], these results suggest that a modified RD could be robust to basis 
mismatch. Similarly, Duarte and Baraniuk |25] proposed a heuristic solution that marries model-based 
CS [6], redundant DFT frames, and standard spectral estimation techniques. 

3.4 Potential MWC sensitivity to the number of channels 

Because of its reliance on CS, the MWC reconstruction algorithm inherits the CS conditions of successful 
reconstruction. One such condition relates the number of CS measurements per unit time (number of 
rows of Y in (|23p ) to the support of S. Assuming that the matrix <fr\I r has maximal rank q', a necessary 
condition for unique recovery of S from the measurements is q' > 2|supp S|, where |supp S| denotes joint 
sparsity of S [TT][35]. Knowing that q' also equals the number of channels in the MWC, it is reasonable 
to want to minimise q 1 so as to minimise the processing and hardware complexity of the MWC, i.e. set 
it as close to 2|suppS| as possible while still ensuring successful recovery. (In practice, this lower bound 
changes depending on the specific CS algorithm employed.) With this in mind, it is natural to ask how 
MWC's performance behaves around this condition boundary. Clearly, beyond the theoretical limit of 
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Figure 4: Reconstruction error as a function of the number of channels q' for the MWC. Q denotes the joint 
sparsity |suppS| of S and S-OMP is used to recover the input signal's support with T = 4 sec, W' — 500 Hz, 
L' — 50, M' — 20. The plots show the rapid increase in error as q' falls below a critical point. For the cases 
shown, the error increases exponentially as q' decreases. Left: active bands have equal amplitude; Right: unequal 
amplitudes. 



2|suppS|, it is impossible to guarantee recovery of the signal's support. Consequently, one expects the 
performance to degrade beyond this point. Figures |4] and [5] show that performance (as measured as 
squared error) can indeed decay rapidly, in fact expontially fast, as a function of the number of channels 
q' . In the example, we consider a sparse multiband signal bandlimited to 500 Hz that is sampled by a 
MWC with a spectral resolution of 20 Hz (L' = 50) and a channel sampling rate of 50 Hz (M ' = 20) for 
various sparsity levels and values of q 1 . The average squared error is computed as ||x — where 
x is a digitally simulated sparse multiband signal, x is the reconstructed signal, and ||-||| denotes the 
£2 norm. Figure U] shows the results when the simultaneously orthogonal matching pursuit algorithm 
(S-OMP) is used to recover the signal's support. For each value of q 1 , we report an error that is averaged 
over 100 trials with each trial using a different randomly generated <I> matrix. Note that when each band 
has equal amplitude (left panel) degradation begins at roughly q' = 2|supp S| log (L'), which is consistent 
with S-OMP [33]. When the band amplitudes are randomly chosen (right panel) the turning points are 
roughly proportional to 2|suppS| log (L 1 ). Figure [5] shows the original and reconstructed signals for one 
specific case. 

The point of this example is to show that when the number of channels is near its theoretical limit, 
a small change in its value can lead to dramatic performance decreases for the MWC. The exact degree 
of degradation is problem dependent however. 

3.5 MWC robustness to windowing 

Let x(t) be a sparse multiband signal with FT X(lo) and let z{t) be a windowed version of x(t), 

z(t) = x(t)w(t), 

where w(t) is an indicator function of some sub- interval of the observation interval. Consider the situation 
where z(t) is the input signal to the MWC, but where the MWC is designed for a multiband signal, i.e., as 
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Figure 5: Results of two specific cases from the experiment in Figure [4] (Q = 3, equal amplitudes). The time 
and frequency domains of the input multiband signals are shown along with the difference signal between the 
input and reconstructed signals (green raised plots). Left: q' = 25, squared error= 3 x 10 -5 ; Right: q' — 18, 
squared error= 0.001. 
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Figure 6: Reconstruction error as a function of signal duration for the MWC. A four band multiband signal 
bandlimited to 500 Hz was systematically windowed and sampled by a MWC (q = 20, 50, L' = 50, M' = 20) 
over an observation interval of 2 sec. Each point represents a squared error averaged over 50 trials. Note that 
the error remains relatively constant over a wide range of signal durations. 



described in Section l2~2l Because time limited signals cannot be bandlimited, this case represents a model 
mismatch scenario where essentially the input signal has shorter duration than expected. The spectrum 
of z{t) equals X(u>) convolved with a sine function, thus the windowing spreads the original spectrum 
X(uj) 39J . One might therefore expect that if the spectrum were sufficiently spread, the sparsity of S(lj) 
would increase to a point where the condition q' > 2|suppS| is violated and performance would degrade 
rapidly as seen above in Figure 01 However, experimental results show that this is not necessarily the 
case. Figure [5] shows that the MWC can be robust to a wide range of signal durations. In fact, for the two 
values of q' shown, the error remains relatively constant for signal durations ranging from 100% to 5% 
of the observation interval. Observe that for q' = 20, reconstruction does eventually break down (signal 
duration less than 0.4 sec), but these signals represent, in some sense, those that maximally mismatch 
the multiband model because of their small compact support. Figure [7| shows time and frequency plots, 
along with the reconstruction errors, for two specific examples for long and short duration signals. 



16 



Reconciling CS Sampling 




100 HU*A - - 



^ — -Ml 



ii I ll I it 



Figure 7: Results of two specific cases from the MWC experiment in Figure|H](g' = 50). The time and frequency 
plots of two windowed multiband signals are show. They represent the two extreme cases: full duration (2 sec), 
short duration (0.1 sec). The raised plots (green) are the difference signals between the original and reconstructed 
signals. Note that the spread of the spectrum has little effect on reconstruction. 



4 Sampling continuous- time block-sparse signals 

This section ends with a description of a novel sampling system whose inspiration comes from a simple 
exercise: we exchange the signal models for the RD and the MWC and analyze the relationship between 
the input and output signals. In particular, we consider sampling sparse multitone signals with the 
MWC and sparse multiband signals with the RD. We discover that the MWC uses what we call "block- 
convolution" to successfully sample and recover block sparse signals. We also discover that the RD does 
not use block convolution and thus cannot, without difficulty, sample multiband signals. In Section [4.3[ 
we apply this concept in a new way and propose a system that can sample and recover continuous-time 
block-sparse signals at low sampling rates. 



4.1 MWC with sparse multitone inputs 

Consider the problem of using a MWC to sample and recover a sparse multitone signal instead of 
sparse multiband signal. Let x(t) be a sparse multitone signal on the observation interval [0, T] with 
bounded harmonics — ^ < S < and denote by X(n) the FS coefficients of x(t) on [0,T] (cf. (|TJ) ) . 
Let the random waveforms Pi(t),i — 0, . . . , q' — 1, be as defined in Section 12. 2[ but with W replaced 
by W; their common period T p thus equaling U?, L' e Z + . Let the ideal low pass filter have impulse 
response h(t) = ^^sinc(^T^i) and let the sampling period be T s = For this problem and for this 
section only, we assume that the duration of the observation interval is an integer multiple of T pi i.e., 
T = Njjy, N E 1j + . We also assume, for ease of exposition, that the period of Pi(t) equals the sampling 
period (T p — T s or that L' = M'). These assumptions are not necessary, but help make this section's 
message clear and easier to understand. Lastly, to make the problem meaningful, we assume T >T S . 
In Appendix 16.31 we derive the following expression relating the FS coefficients of x(t) to the DFT 
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coefficients of the output samples, 

L'-l 

Yi(n) = Vim) X(n - TVm)rect(^n) ^ p u e^ lm , (24) 

m= _L!(z/+i)j+i ;=o 

where p u = Pl (t) for t 6 [l/W, (I + 1)/W) and 

1 1 - e-JT7 m 



m ^ 

r)(m) = { N j27rm 

1/L' m = 

for —N/2 < n < N/2 and i — 0, ... ,q' — 1. This expression is analogous to the frequency domain 
description of the RD given by (|9|) and is what (fl~6| becomes assuming a sparse multitone signal model 
and L' — M'. In matrix form, (PMl) becomes the MMV problem, 



Y = *tf S, (25) 



where 



= Pil 

.2tt, 

Sr,t> = rj(m r )X(n v — A^m r )rect(-^-n l) ) 
1 1 — p~ J Tj m r 
N j2mn r 

for i = 0, . . . , q' - 1, I = 0, . . . , V - 1, v = 0, . . . , N - 1, ^ = - [f J + i>, r = 0, . . . , L' - 1 and 
m r = — \_\{L' + 1)J + 1 + r. In contrast to sampling a sparse multiband signal, this MWC MMV problem 
does not result from truncation, rather its finiteness derives from the fact that multitone signals are 
finitely parameterised. The Fourier components of x(t) can be recovered by solving (|25l) using several 
existing CS algorithms including greedy algorithms [33], mixed norm approaches [H], MUSIC based 
recovery algorithms and, in particular, the approach proposed by Mishali and Eldar in [llj and |31) . 
One can thus successfully sample and recover sparse multitone signals with the MWC. This fact is 
somewhat surprising since the MWC is designed to sample multiband signals, not multitone signals. 

More importantly, this example highlights a property of the MWC that differentiates it from the 
RD and allows it to successfully sample and recover multitone signals, as well as multiband signals. 
Specifically, the convolution in (|24|) involves shifts of X(n) by integer multiplies of N that, when greater 
than one, yields a "block convolution". By block-convolution, we mean every DFT coefficient Yi(n) 
in (1241) is a linear combination of finite segments of X(n). Block-convolution is also seen in (|16[) where 
the FT of a multiband signal is shifted by integer multiplies of In contrast, it is not seen in ^ where 
the frequency shifts describing the RD are by one. This aspect of the MWC allows the construction of 
a linear system like ([2"U|) and (|2~5|) that describe the original spectrum in terms of linear combinations of 
these blocks. The blocks themselves represent a partition of the frequency axis that effectively discretises 
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a signal's spectrum. In Section 14.31 below, we incorporate this property into a mult i- channel random 
convolution system that samples and approximately recovers continuous-time block-sparse signals, the 
time domain analogue of sparse multiband signals. 

When the observation interval, the period of Pi(t), and the sampling period are equal, i.e. when 
N = 1 and L' — M', (|25|) collapses to a SMV problem, where the matrices Y and S become the 
vectors [Yo(0), ... ,l^'-i(0)] and [rj(mo)X(— mo), . . . , r)(mL'-i)X(— m£/_i)] respectively. Note that in 
this special case, the MWC and the RD produce equivalent SMV problems, the only difference being 
the timing in how the samples are acquired — the MWC collects a measurement vector in parallel (each 
channel samples once) while the RD collects its samples sequentially. 

4.2 RD with sparse multiband inputs 

When a multiband signal is the assumed signal model for the RD, the system fails to produce a single 
measurement vector problem whose solution recovers x(t). To be more concrete, let x(t) be a sparse 
multiband signal bandlimited to W'/2 Hz with a fixed support T. Consider a RD parameterised by 
M that samples x(t) on [0, T] and let p(t) be as described in Section |2~T1 A similar analysis to that 
contained in Appendix 16.11 leads to the expression 



where here N' represents the number of Nyquist periods within the observation window (N' — TW'). 
Here, for simplicity, we assume N' is a positive integer. This expression relates the time domain output 
samples y(k) to the Fourier spectrum of x(t). To construct a finite dimensional linear system consistent 
with the RD formulation, one could approximate the integral by discretizing w: 



where <5 W = 2z^L for some positive integer D and uji — —nW + 6 u (i + 1/2). One can then see that 
in comparison to © (and also (|3"3"f in Appendix 16. ip . this expression describes a SMV problem whose 
dimensions grow linearly with D. Clearly, in cases where one wants to closely approximate the integral 
or finely discretize the u> axis, the size of the SMV problem could become computationally unwieldy. In 
fact, examples from [20] show that naively modeling a multiband signal as a multitone signal can lead 
to computationally prohibitive or even intractible CS problems given current technology. In comparison 
to the MWC, one reason for this difficulty is that the RD does not use block- convolution. 




N'/M-x 
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4.3 Continuous-time block-sparse sampler 

The class of continuous-time block signals Q{T,to) is the set of continuous-time, real-valued, finite energy 
signals whose support is a finite union of bounded intervals, 

G(T, t ) = {x(t) g i 2 ([0, t ]) n C([0, t }) : x(t) =0,t£T} 

where C([0,£o]) denotes the set of continuous functions on [0,io] an d 

K 

T=\J i a ti h), < a i} bi < t < oo. 

i=i 

A continuous-time block-sparse signal is a continuous-time block signal whose support has Lebesgue 
measure that is small relative to the signal's overall duration, i.e., A (T) <C to- 

The proposed system combines block convolution with the MWC architecture and the random con- 
volution ideas of Romberg to obtain a sampling system for continuous-time block-sparse signals (see 



Figure 8(a) ). The resulting system can also be interpreted as the time domain analogue of the MWC. 

Matusiak and Eldar [40] recently proposed a sampling system targeting a nearly identical signal class. 
Their system also shares structural similarities to the MWC but leverages Gabor frames instead of block 
convolution. 

Sampling System. Let x(t) be a continuous block-sparse signal on the interval [0, T) and let Z be a 
Bernoulli random variable taking values ±1 with equal probability. Denote by {pi(l), I = 0, . . . , L— l}fT 
an ensemble of random vectors drawn from Z (L,q G Z + ). The sampling system has q channels and 
operates in parallel like the MWC: the i th channel convolves x(t) with Pi(l), resulting in the continuous- 
time signal 

L-1 

M*) = Ef*W a; (*- i r). ( 26 ) 

and then uniformly samples gi{t). Note that by construction the convolution in (|26[) is a block-convolution 
(see Section |4.1[) and is the standard filtering operation underlying waveform synthesis [39 > p. 123]. 
Restricting the time axis to the interval [0,T/L], ([26]) becomes 

L-1 

^(i)rect(f = 5>i(0z(i-/£)rect(f (<-£)), (27) 

1=0 

for i = 0, . . . , q — 1. This system of equations is similar to the frequency domain description of the 
MWC (equation (JTSJ) ) in that the unknowns are segments of the continuous signal of interest; here 
we are interested in recovering \x{t — ^)rect(^(t — <§;))}, whereas for the MWC we want to recover 
{X(u— m 27T ™ )rect(^ 77 iu;)} m . Consequently, given the block sparsity of x(t), one could proceed to solve 
the linear system in a manner similar to that proposed by Mishali and Eldar for the MWC. Alternatively, 
one could simply discretize the time axis, form a MMV problem from (f2"T|) and solve for samples of the 
segments {x(t — Z^)rect(^(t — jj;))},- (This type of approach was described in Section l4.2l ) Either way, 
the solution that is obtained would be an approximation of the original block-sparse time domain signal: 
the MWC reconstruction method would yield a linear approximation (see Section l3~2"j) . and discretizing 
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the time axis would produce samples between which one would have to interpolate. Below, we present 
another method to compute a linear approximation that, unlike the Mishali and Eldar method, uses 
CS techniques to directly retrieve both the support of x(t) (at resolution L) and the coefficients of the 
approximation simultaneously. 

Representing the segments {x(t — I j;)rect (%jf(t — j L -))} l in an appropriate orthogonal basis, ([27]) 
may be written as 

L—l oo 

g .(t)tect(%(t - &)) = £)pi(0£aii(»M.(t), (28) 

1=0 n=0 

where a/(n) = (x(t — i?)rect(l^(i — ^)) ,^ n {t)j ■ Sampling at a rate Hz over the interval [0, jj] 
yields 

y 4 (fc) = g l {k^ M ) 

L — l oo 

= Z)ft(0Z)ai(n)^»(*w). (29) 
;=o n=0 

for fc = 0, . . . , M — 1 and i = 0, . . . q — 1. Thus to recover a Z)-term linear approximation, one needs to 
solve for the coefficient matrix A in the matrix form of ([29)) . 

Y = (30) 

where 





= Vi(k) 


= 


= Pi(l) 




= OLi(n) 


*n,fe 


= ipn(k 



LM ' 

for i = 0, . . . , q — 1, k = 0, . . . , M — 1, / = 0, . . . , L — 1, and n = 0, . . . , D — 1. Given the samples {yi{k)} 
and the matrices $ and one can form an MMV problem by post multiplying both sides of ([3"U|) by 
the right-inverse of when it exists. This yields 

Y*t = *A (31) 

where \&t denotes the right-inverse of The right-inverse exists if and only if the columns of & span 
M D [H], which is only possible if D < M, or equivalently, when the approximation order is less than 
or equal to the number of acquired samples. Thus, assuming exists, the maximum approximation 
order D that can be recovered with this scheme equals the number of samples M acquired per channel. 
Conversely, the minimum number of samples needed to successfully recover a D-order approximation 
is M. However, the desire to recover an approximation that is as accurate as possible while using a 
minimum number of samples suggests that setting D = M is an optimal choice. Thus, in what follows, 
we assume ^ is a square matrix that has an inverse ^ . The linear system in (|3ip therefore becomes 

Y*- 1 = *A. (32) 
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From a CS perspective, 3? is a q x L Bernoulli sensing matrix where q < L, and the elements of the 
product Y* -1 are the CS measurements. In this case, and in contrast to the RD and the MWC, these 
measurements are not simply the samples acquired by the system (c.f. equations (1191) and (|23[1 ). but are 
linear combinations of the samples. The coefficient matrix A can be solved for using any existing MMV 
CS solver, see e.g. |31H36j . The conditions defining when a unique solution exists has been extensively 
studied and depends on the properties of <fr and the degree to which A is sparse (again, see [3TH55] ). 
Note that because x(t) is assumed to be block-sparse, it follows that A is a joint sparse matrix. 

Letting A; >n = S;(n) denote the entries of the CS solution of (j3"2"j) . x(t) can be approximated (recon- 
structed) by computing an approximation for each segment {x(t — /-^)rect(^(i — jj;)) },, I = 0, . . . , L — l, 

D-l 

x(t ~ l£)rect(^(t m(n)Mt) 

and then concatenating the L segment approximations. 

Example. Consider the continuous-time block sparse signal depicted in the top panel of Figure [8(b) | 
Samples of the filtered signals gi(t) (middle panel) are acquired by an 8 channel system (q — 8) with 
a sampling rate of 400 Hz (M — 40,LM/T = 400) where the time axis is partitioned into 10 segments 
(L = 10). Using a Fourier basis, ip n (t) — e T l h , the linear system ([30]) was solved using S-OMP [33] 
resulting in L linear approximations (D — 40) of the segments {x(t — Z^)rect(l^(t — ]§;))},■ The 
reconstructed signal is shown in the bottom panel of Figure [8(b)] In this case, the reconstructed signal 
is a faithful representation of x{t) (normalized squared error = 0.055). Clearly, the quality of the 
reconstruction depends on the accuracy of the approximations and on how well a particular CS algorithm 
solves Q32[) . For a given application, some bases will be more appropriate than others. For example, if 
x(t) contains discontinuities, a wavelet basis would likely provide a better approximation than a Fourier 
basis. 

Strictly speaking continuous-time block sparse signals are not bandlimited, but if one examines the 
spectrum of the test signal in Figure [8(b) [ one would discover that the signal is "essentially" bandlimited 
to about 1000 Hz. Thus, it could be argued according to the Shannon-Nyquist sampling theorem that 
a sampling rate of about 2000 Hz would be required to accurately capture this signal. Relative to 2000 
Hz, 400 Hz represents a five fold reduction in sampling rate. 

5 Conclusion 

In this paper, we showed that the sampling mechanisms of the RD and the MWC can both be thought 
of as being based on the underlying concept of random filtering or random convolution. The most 
substantial difference between the systems stems from the specific form of their sampling functions (or 
random filters) and from the assumed signal models. The RD has sampling functions that have finite 
temporal extent but infinite spectral support; the MWC employs sampling functions that have finite 
spectral support but infinite temporal support. The randomness in the sampling functions is a hallmark 
of CS theory that is fundamental in guaranteeing unique solutions to the underdetermined linear systems 
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Figure 8: (a) Schematic diagram of a continuous-time block-sparse sampling system. The system is a general- 
ization of random convolution as originally proposed in [9]. Each channel convolves x(t) with a random sequence 
and then samples the result at a low rate, (b) Top panel: Simulated block sparse time domain signal on the unit 
interval. The signal is a modified version of the "bumps" test signal from the WaveLab toolbox [32] ■ Middle 
panel: Overlay plot of the signals Qi(t) resulting from the random convolution. Here, x(t) was sampled with a 
8 channel system. Bottom panel: Reconstructed signal approximation. Each 0.1 second duration segment is a 
linear approximation (Fourier basis) of x(t) on that segment (normalized squared error= 0.055). 



that characterise the RD and the MWC. 

Block convolution is also an important property that differentiates the MWC from the RD because 
it is one approach that effectively processes infinite dimensional signals that have a block structure. 
The absence of this property is one primary reason the RD cannot, in general, reconstruct multiband 
signals. We incorporated block convolution into a new sampling system that samples continuous-time 
block-sparse signals. 

We also offered two novel insights into how the MWC behaves with regard to the underlying sparsity 
assumption. We showed that if the number of channels is near the minimum required, relatively small 
changes in the number of channels (or equivalently if the sparsity changes for a given number of channels) 
can cause significant reconstruction errors. On the other hand, we provided evidence that the MWC is 
robust to increases in the width of the spectral bands caused by windowing. 

From this paper's perspective, one can begin to consider generalisations to the RD and the MWC 
that target different signal classes, in particular, CS sampling systems that have different time-frequency 
characterisations. For example, a system that "compressively" samples radar pulses and chirps in an 
efficient time-frequency manner could possibly offer a means to effectively detect and classify these 
signals while avoiding the overhead of sampling several bands simultaneously or reconstructing the 
Nyquist equivalent signal. This paper takes a step towards this goal by reconciling some of the core 
ideas behind these sampling systems. 
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6 Appendices 

The following analyses yield basic time and frequency domain descriptions of the sampling strategies. We 
employ standard Fourier transform properties without explicit explanation for the sake of conciseness. 
The notational style is that of [35]. To denote transform pairs, we use the shorthand notation, 



and use the abbreviations FT, FS, DTFT, and DFT when referring to the Fourier transform, the Fourier 
series, the discrete time Fourier transform, and the discrete Fourier transform, respectively. Also recall 
the definitions: sinc(x) = sin(x)/x, x G K and 



6.1 Derivations of Equations and (JH]) 

Time domain description. Let x(t) be a sparse multitone signal on [0,T] and recall the following 
transform pairs from Section 12.11 



x(t) <■ 



FT 



1 for - 1 < x < 1 



rect(x) 



otherwise 



x(t) ( FS:1/T > X(n) 
p{t) < FS;1/T ) P(n) 
h(t) - rcct(2^t - 1) H(u) = ^sinc(^o;)e- J '' 



By inspection of Figure l(a)| the time domain description of the RD is 



9(t) 



x(t)p(t) * h(t) 




T 



A I 




N/2-1 



where * denotes convolution. Sampling at t = [k + 1)^ for k = 0, 1, . . . , M — 1 yields 



y(k)=g{(k + l)%) 




(33) 
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where each step follows from the additivity of the integral and the specific nature of p(t). Evaluating 
the integral, we obtain 

e 3 - nT dT={ fiTtn (34) 

k M + W n = 

Substituting (|34|) into (|33|) and letting I = + to, f[33]) may be rewritten as 

N 

N/2-1 (fe+l)M"l 2 

y(k) = Yl <*(n)X(n) £ Pie?*" 1 , 

n=-N/2 ,_, N_ 

M 

for k = 0, . . . M — 1 where = pf^/PF) and 

T n^O 

j 27m 

1/VK n = 

Frequency domain description. We also have the following frequency domain description of the RD. 
Multiplication/Convolution: 

Ln-f J 
FS;1/T ^ 



x(t)p(t) < : > y P(m)X(n - to) 



m=-L»i--fJ+l 



Convolution (filtering) /Multiplication: 

g(t) = x(t)p(t) * h{t) 
X p S; i/T 

L«-f J 

G{n) = P{m)X{n - m)H{j^-n) 

m=-ln-% J+l 
T Ln-fJ 

= m ^ ^("W" - m)e-^*sinc(&n) 

m=-L"-fJ+l 

Sampling/ Aliasing: 

oo 

= + 1)£) < DFT;M > y(n) - M £ G(n - IM) 

I— — OO 

Because Y(n) is M -periodic, we can, without loss of information, restrict it to one period. This means 
we need only consider one term in the summation over I, Retaining the / = term yields 

oo 

Y(n) = T Y P(m)e- j ^- n smc(^n)X(n-m), 

for n = 0, . . . , M - 1. 



m— — oo 
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6.2 Derivation of Equation (JT 



Let x(t) be a sparse multiband signal and recall the following transform pairs from Section 



x(t) X(w) 

FSiW'/L' 

Pi(t) < > Pi{m) 

h(t) = ^sinc(^i) H(u) = rect(^). 



We then have the following time and frequency domain descriptions for the i th channel, i = 0, . 
of the MWC. 

Multiplication/Convolution: 

oo 

FT x — ^ 

x(t)pi(t) < > y Pi(m)X(ui — mujp) 



m— — oo 



l(u+irW')/u p \ 
= Pi(m)X(uj — muip) 

m=r(w-7rH")/^pl+l 

where lo p — 2-kW' / L' radians per second. The summation limits are finite for a given u) because x(t) is 
assumed to be bandlimitcd. 

Convolution (filtering) /Multiplication: 

9i(t) = x(t) Pi (t) * h(t) 

I FT 

\_(u + irW)/u p \ 

Gi(ui) = Pi(m)X(u) — mu p )H(uj) 

m= [(w-rr W )/<JJp] +1 

= ^2 Pi{m)X(uj - muj p )iect(^) 

Note that the low pass filter windows X(u>) and its translates (i.e., restricts them to the interval 
[— LUp/2, ujp/2]) and hence removes the dependence on ui in the summation limits. 



Sampling/ Aliasing: 



, Tirl ™ 

Vi(k) = 9i{kT s ) < DTFT > Y^wr) = £ G,(w+nw.) (35) 

n— — oo 

Observe that the translates of Gi(uj) in (f3"B"|) do not overlap because Gi{w) is bandlimited to [—u> p /2, ui p /2] 
and by assumption uj p < uj s (see Section 12. 2[) . We can therefore, without loss of information, restrict 
Y z (e ju] — ) to one period (Y^e^w 7 ) is 27rH"/M'-periodic) . This means we need only consider one term 
in the summation over n in (|35l) . We choose to retain the n = term and thus have the DTFT 

y,(e^#0rect(^) = 
Li(i'+i)J 

M' 



— Y, Pi{m)X{u-mu p )Te C t{^). 



m=-Li(L' + l)J+l 
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The Fourier series coefficients of Pi(t) can now be directly computed, 

Pi(m) = ^- f TP Mt) e~ 3 ^ mt dt 
J-P Jo 

^ 1=0 Jl — 



T7l2i=o Pu, m = 0, 



(36) 



where pu = p l (t) for t 6 [l/W, {I + 1)/W), to obtain 



where 



Li(i'+i)J L'-l 
£ /3(m) X(u - m^)rect(^ 7W ) £ p« e" 
m= _L|(i'+i)J+i '=o 

(W' l-e-^ m 

1 m / 

/3(m) = I M> j2Trm r . 

[l/L', m = 

6.3 Derivation of Equation ( 1241) 

Let :c(i) be a sparse multitone signal on [0,T] and recall the following transform pairs and parameter 
relations from Section 14.11 

x(t) ( FS;1/T > X(n) 
( FS;1/Tp > P<(m) 

fc( t ) = lW s - mc{IL W t) = rect ( J^) 



Also recall the simplifying assumption L' = M' . In the following analysis, we use the periodic extension 
of x(t), denoted by x(t), because it is defined for all t £ R (thereby simplifying calculations) and has 
the same FS coefficients as x(t). Its use does not imply that x(t) must be replicated before the MWC 
samples it. Rather it implies a discretisation step of the frequency axis that must otherwise be explicitly 
carried out to form the MMV problem in (1231) . 
Multiplication/Convolution: 

L£(«+^)J 

x{t)pi{t) < FS:1/T ) ^ P l {m)X{n-Nm) 

m=r^(™-^)i+i 

Note that the left hand side is defined for all i S R and the right is defined for all n G Z. The summation 
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limits are finite (for a given n) because the harmonics of x(t) are assumed to be bounded. 
Convolution (filtering) /Multiplication: 

gi (t)=x(t) P i(t)*h(t) 

X FS;1/T 

Gi{n)= Pi(m)X(n- Nm)H(^n), 

m=r-^(n-^)i+i 
L|(i'+i)J 

= Y Pi{m)X(n- jVm)rect(j^n) 

m=-L^(i'+l)J+l 



where we note that gi{t) is 1/T-periodic and iJ(^n) = -ff (w)| w=27m /T- Note that the low pass filter 

L' ' L' 



windows X(n) and its translates, i.e., restricts them to the interval [— -^t-, 2 ^"] (or [— iV/2, iV/2]) and 



hence removes the dependence on n in the summation limits. 
Sampling/ Aliasing: 

i 00 

Vi(k) = 9i(kT s ) < DFT;JV > y,(n) = — J] G «( n - Z7V ) ( 37 ) 

/— — 00 

Because Gi{n) is "bandlimited" to [— N/2, N/2], its translates in f|37jl do not overlap. Consequently, it 
is sufficient to only consider one period of Yi (n) , or equivalently, only one term in the above summation. 
Choosing the I — term, (|37p becomes 

Li(£'+i)J 

Y ^ = N ^ Pi(m)X(n-Nm)iect(% n ), (38) 

m=-L|(L' + l)J+l 

for —N/2 <n< N/2. By substituting the expression for Pi(m) from (f3"r?| into (|38p . we have the relation 

L|(i'+i)J L'-l 
*i(n) = ^ »j(m) X(n - 7Vm)rect(f n) £ pu e~*& lm , 

m= _Li(z,'+i)j+i 1=0 

where p u = p l {t) for t 6 [l/W, (I + 1)/W) and 

1 1 - e -ii7™ 



■ m ^ 

77(771) = { N j2irm 

1/L' m = 
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